Detection of Breast Cancer using Data Mining Tool (WEKA)

نویسنده

  • Jyotismita Talukdar
چکیده

Breast cancer has become the primary reason of death in women in developed countries. Breast cancer is the second most common cause of cancer death in women worldwide. The high incidence of breast cancer in women has increased significantly during the last few decades. In this paper we have discussed various data mining approaches that have been utilized for early detection of breast cancer. Breast Cancer Diagnosis is distinguishing of benign from malignant breast lumps. We have approached the diagnosis of this disease by using Data mining technique. Data mining is an essential step in the process of knowledge discovery in databases in which intelligent methods are applied in order to extract patterns. The most effective way to reduce breast cancer deaths is to detect it earlier. This paper discusses the early detection of breast cancer in three major steps of determining the breast cancer. They include (i) collection of data set, (ii) preprocess of the data set and (iii) classification. Data mining and machine learning depend on classification which is the most essential and important task. Many experiments are performed on medical datasets using multiple classifiers and feature selection techniques. A good amount of research on breast cancer datasets is found in literature. Many of them show good classification accuracy. For classification we have chosen J48.All experiments are conducted in WEKA data mining tool. Data-Sets are collected from online repositories which are of actual cancer patient Key WordsBreast Cancer, Data Mining, WEKA, J48 Decision Tree, ZeroR ——————————  ——————————

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Detection of Breast Cancer Progress Using Adaptive Nero Fuzzy Inference System and Data Mining Techniques

Prediction, diagnosis, recovery and recurrence of the breast cancer among the patients are always one of the most important challenges for explorers and scientists. Nowadays by using of the bioinformatics sciences, these challenges can be eliminated by using of the previous information of patients records. In this paper has been used adaptive nero fuzzy inference system and data mining techniqu...

متن کامل

Comparing the Performance of Data Mining Tools: WEKA and DTREG

The objective of the paper is to compare two data mining tools on the basis of various estimation criteria. The data mining tools which are evaluated are WEKA and DTREG. These tools are used to build multilayer perceptron which is a data mining model to predict the survivability of the oral cancer patients. Oral cancer database is considered as it is estimated to be 8th most common cancer world...

متن کامل

An Efficient Prediction of Breast Cancer Data using Data Mining Techniques

Breast cancer is one of the major causes of death in women when compared to all other cancers. Breast cancer has become the most hazardous types of cancer among women in the world. Early detection of breast cancer is essential in reducing life losses. This paper presents a comparison among the different Data mining classifiers on the database of breast cancer Wisconsin Breast Cancer (WBC), by u...

متن کامل

Mining Big Data: Breast Cancer Prediction using DT - SVM Hybrid Model

Breast Cancer is becoming a leading cause of death among women in the whole world; meanwhile, it is confirmed that the early detection and accurate diagnosis of this disease can ensure a long survival of the patients. This paper work presents a disease status prediction employing a hybrid methodology to forecast the changes and its consequence that is crucial for lethal infections. To alarm the...

متن کامل

مطالعات درخت تصمیم در برآورد ریسک ابتلا به سرطان سینه با استفاده از چند شکلی‌های تک نوکلوئیدی

Abstract Introduction:   Decision tree is the data mining tools to collect, accurate prediction and sift information from massive amounts of data that are used widely in the field of computational biology and bioinformatics. In bioinformatics can be predict on diseases, including breast cancer. The use of genomic data including single nucleotide polymorphisms is a very important ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015